A New Method for Text-Line Segmentation for Warped Documents
نویسندگان
چکیده
Bound documents either scanned or captured with digital cameras often present a geometrical warp that makes text-lines curled. The identification of text-lines is one of the steps for document de-warping when only a single image is available. This paper presents a new method for text-line segmentation. It is based on a simple, but effective, skew detector proposed by ÁvilaLins and simplifies the idea of coupled snakes introduced by Bukhari to a moving parallel line regression. The proposed method performed better than the best of the similar algorithms in the literature.
منابع مشابه
Fast Restoration of Warped Document Image based on Text Rectangle Area Segmentation
The warp problems usually make the documents being hardly recognized. Specifically, when we copy a page of a thick book or bound document by digital photocopier, the resulted image is usually warped because of the thickness of the document. We focus on this problem and propose a fast method to restore the warped document image in this paper. The text rectangle area of the document is one of the...
متن کاملA New Algorithm for Detecting Text Line in Handwritten Documents
Curvilinear text line detection and segmentation in handwritten documents is a significant challenge for handwriting recognition. Given no prior knowledge of script, we model text line detection as an image segmentation problem by enhancing text line structure using a Gaussian window, and adopting the level set method to evolve text line boundaries. Experiments show that the proposed method ach...
متن کاملSegmentation of Handwritten and Printed Arabic Documents
on this paper, we proposed a new text line segmentation of handwritten and typewriting Arabic document images that uses the Outer Isothetic Cover (OIC) algorithm of a digital object. In the first step, we use this method to segment the composed document into text blocs. In the second step, for each text bloc we will extract the text lines. Finally, line text will be segmented into words or into...
متن کاملText line segmentation in handwritten documents using Mumford-Shah model
Text line segmentation in handwritten documents is an important step in document processing. We present a new text line segmentation method based on the Mumford-Shah model. The algorithm is script independent. In addition, we use morphing to remove overlaps between neighboring text lines and connect broken ones. Experimental results show the validity of our method.
متن کامل